Overview

Dataset statistics

Number of variables21
Number of observations1628
Missing cells0
Missing cells (%)0.0%
Duplicate rows628
Duplicate rows (%)38.6%
Total size in memory960.3 KiB
Average record size in memory604.0 B

Variable types

NUM11
CAT8
BOOL2

Reproduction

Analysis started2020-06-06 06:13:32.936572
Analysis finished2020-06-06 06:13:54.144493
Duration21.21 seconds
Versionpandas-profiling v2.7.1
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 628 (38.6%) duplicate rows Duplicates
NumCompaniesWorked has 198 (12.2%) zeros Zeros
PropCurrMgrCompYears has 414 (25.4%) zeros Zeros
PropCurrRoleCompYears has 361 (22.2%) zeros Zeros

Variables

Age
Real number (ℝ≥0)

Distinct count43
Unique (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.6455773955774
Minimum18
Maximum60
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum18
5-th percentile21
Q129
median34
Q342
95-th percentile53
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.481794355
Coefficient of variation (CV)0.2660019853
Kurtosis-0.4843929692
Mean35.6455774
Median Absolute Deviation (MAD)6
Skewness0.4315411626
Sum58031
Variance89.90442419
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
31 92 5.7%
 
29 85 5.2%
 
35 80 4.9%
 
34 73 4.5%
 
30 70 4.3%
 
26 68 4.2%
 
32 65 4.0%
 
33 60 3.7%
 
28 58 3.6%
 
36 55 3.4%
 
Other values (33) 922 56.6%
 
ValueCountFrequency (%) 
18 13 0.8%
 
19 23 1.4%
 
20 27 1.7%
 
21 30 1.8%
 
22 13 0.8%
 
ValueCountFrequency (%) 
60 3 0.2%
 
59 8 0.5%
 
58 15 0.9%
 
57 3 0.2%
 
56 9 0.6%
 

Attrition
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
0
843
1
785
ValueCountFrequency (%) 
0 843 51.8%
 
1 785 48.2%
 

BusinessTravel
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Travel_Rarely
1105
Travel_Frequently
403
Non-Travel
 
120
ValueCountFrequency (%) 
Travel_Rarely 1105 67.9%
 
Travel_Frequently 403 24.8%
 
Non-Travel 120 7.4%
 

Length

Max length17
Mean length13.76904177
Min length10
ValueCountFrequency (%) 
Lowercase_Letter 11 64.7%
 
Uppercase_Letter 4 23.5%
 
Dash_Punctuation 1 5.9%
 
Connector_Punctuation 1 5.9%
 
ValueCountFrequency (%) 
Latin 15 88.2%
 
Common 2 11.8%
 
ValueCountFrequency (%) 
ASCII 17 100.0%
 

DistanceFromHome
Real number (ℝ≥0)

Distinct count29
Unique (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.735257985257986
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median8
Q315
95-th percentile26
Maximum29
Range28
Interquartile range (IQR)13

Descriptive statistics

Standard deviation8.306546029
Coefficient of variation (CV)0.8532435445
Kurtosis-0.4260285132
Mean9.735257985
Median Absolute Deviation (MAD)6
Skewness0.8685624445
Sum15849
Variance68.99870694
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 214 13.1%
 
2 211 13.0%
 
9 129 7.9%
 
10 90 5.5%
 
3 84 5.2%
 
7 82 5.0%
 
4 77 4.7%
 
5 76 4.7%
 
8 76 4.7%
 
6 56 3.4%
 
Other values (19) 533 32.7%
 
ValueCountFrequency (%) 
1 214 13.1%
 
2 211 13.0%
 
3 84 5.2%
 
4 77 4.7%
 
5 76 4.7%
 
ValueCountFrequency (%) 
29 41 2.5%
 
28 25 1.5%
 
27 13 0.8%
 
26 26 1.6%
 
25 33 2.0%
 

EducationField
Categorical

Distinct count6
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Life Sciences
623
Medical
521
Marketing
197
Technical Degree
162
Other
 
85
ValueCountFrequency (%) 
Life Sciences 623 38.3%
 
Medical 521 32.0%
 
Marketing 197 12.1%
 
Technical Degree 162 10.0%
 
Other 85 5.2%
 
Human Resources 40 2.5%
 

Length

Max length16
Mean length10.52579853
Min length5
ValueCountFrequency (%) 
Lowercase_Letter 17 65.4%
 
Uppercase_Letter 8 30.8%
 
Space_Separator 1 3.8%
 
ValueCountFrequency (%) 
Latin 25 96.2%
 
Common 1 3.8%
 
ValueCountFrequency (%) 
ASCII 26 100.0%
 

EmployeeNumber
Real number (ℝ≥0)

Distinct count1000
Unique (%)61.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1000.9858722358722
Minimum1
Maximum2068
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile90.35
Q1509.25
median977
Q31494
95-th percentile1953.95
Maximum2068
Range2067
Interquartile range (IQR)984.75

Descriptive statistics

Standard deviation585.4176938
Coefficient of variation (CV)0.5848411152
Kurtosis-1.136645074
Mean1000.985872
Median Absolute Deviation (MAD)491
Skewness0.08723216791
Sum1629605
Variance342713.8763
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 5 0.3%
 
514 5 0.3%
 
1486 5 0.3%
 
478 5 0.3%
 
485 5 0.3%
 
1467 5 0.3%
 
1457 5 0.3%
 
488 5 0.3%
 
492 5 0.3%
 
494 5 0.3%
 
Other values (990) 1578 96.9%
 
ValueCountFrequency (%) 
1 5 0.3%
 
2 1 0.1%
 
4 5 0.3%
 
5 1 0.1%
 
8 1 0.1%
 
ValueCountFrequency (%) 
2068 1 0.1%
 
2062 1 0.1%
 
2061 1 0.1%
 
2060 1 0.1%
 
2057 1 0.1%
 
Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
3
480
4
472
1
372
2
304
ValueCountFrequency (%) 
3 480 29.5%
 
4 472 29.0%
 
1 372 22.9%
 
2 304 18.7%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

JobInvolvement
Categorical

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
3
925
2
447
4
 
130
1
 
126
ValueCountFrequency (%) 
3 925 56.8%
 
2 447 27.5%
 
4 130 8.0%
 
1 126 7.7%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

JobRole
Categorical

Distinct count9
Unique (%)0.6%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
Sales Executive
365
Research Scientist
341
Laboratory Technician
310
Sales Representative
172
Manufacturing Director
121
Other values (4)
319
ValueCountFrequency (%) 
Sales Executive 365 22.4%
 
Research Scientist 341 20.9%
 
Laboratory Technician 310 19.0%
 
Sales Representative 172 10.6%
 
Manufacturing Director 121 7.4%
 
Healthcare Representative 110 6.8%
 
Manager 90 5.5%
 
Human Resources 72 4.4%
 
Research Director 47 2.9%
 

Length

Max length25
Mean length18.11056511
Min length7
ValueCountFrequency (%) 
Lowercase_Letter 20 69.0%
 
Uppercase_Letter 8 27.6%
 
Space_Separator 1 3.4%
 
ValueCountFrequency (%) 
Latin 28 96.6%
 
Common 1 3.4%
 
ValueCountFrequency (%) 
ASCII 29 100.0%
 

JobSatisfaction
Categorical

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
3
529
4
434
1
356
2
309
ValueCountFrequency (%) 
3 529 32.5%
 
4 434 26.7%
 
1 356 21.9%
 
2 309 19.0%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

NumCompaniesWorked
Real number (ℝ≥0)

ZEROS
Distinct count10
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7616707616707616
Minimum0
Maximum9
Zeros198
Zeros (%)12.2%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.54999527
Coefficient of variation (CV)0.92335238
Kurtosis-0.1540142041
Mean2.761670762
Median Absolute Deviation (MAD)1
Skewness0.9859588452
Sum4496
Variance6.502475879
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 607 37.3%
 
0 198 12.2%
 
3 154 9.5%
 
2 152 9.3%
 
4 146 9.0%
 
6 98 6.0%
 
7 90 5.5%
 
5 70 4.3%
 
9 67 4.1%
 
8 46 2.8%
 
ValueCountFrequency (%) 
0 198 12.2%
 
1 607 37.3%
 
2 152 9.3%
 
3 154 9.5%
 
4 146 9.0%
 
ValueCountFrequency (%) 
9 67 4.1%
 
8 46 2.8%
 
7 90 5.5%
 
6 98 6.0%
 
5 70 4.3%
 

OverTime
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
No
1000
Yes
628
ValueCountFrequency (%) 
No 1000 61.4%
 
Yes 628 38.6%
 

PercentSalaryHike
Real number (ℝ≥0)

Distinct count15
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.207616707616708
Minimum11
Maximum25
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q318
95-th percentile22
Maximum25
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.686703092
Coefficient of variation (CV)0.2424247772
Kurtosis-0.2771366513
Mean15.20761671
Median Absolute Deviation (MAD)2
Skewness0.8311232246
Sum24758
Variance13.59177969
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11 243 14.9%
 
12 222 13.6%
 
13 219 13.5%
 
14 199 12.2%
 
15 128 7.9%
 
16 103 6.3%
 
17 100 6.1%
 
18 93 5.7%
 
22 68 4.2%
 
19 68 4.2%
 
Other values (5) 185 11.4%
 
ValueCountFrequency (%) 
11 243 14.9%
 
12 222 13.6%
 
13 219 13.5%
 
14 199 12.2%
 
15 128 7.9%
 
ValueCountFrequency (%) 
25 18 1.1%
 
24 29 1.8%
 
23 32 2.0%
 
22 68 4.2%
 
21 56 3.4%
 

StockOptionLevel
Categorical

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
0
836
1
557
2
 
135
3
 
100
ValueCountFrequency (%) 
0 836 51.4%
 
1 557 34.2%
 
2 135 8.3%
 
3 100 6.1%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

TotalWorkingYears
Real number (ℝ≥0)

Distinct count39
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.949017199017199
Minimum0
Maximum38
Zeros15
Zeros (%)0.9%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q15
median8
Q313
95-th percentile25
Maximum38
Range38
Interquartile range (IQR)8

Descriptive statistics

Standard deviation7.482935655
Coefficient of variation (CV)0.7521281253
Kurtosis1.147729867
Mean9.949017199
Median Absolute Deviation (MAD)4
Skewness1.169746211
Sum16197
Variance55.99432602
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10 185 11.4%
 
1 160 9.8%
 
6 135 8.3%
 
8 125 7.7%
 
7 109 6.7%
 
5 98 6.0%
 
9 90 5.5%
 
4 79 4.9%
 
2 61 3.7%
 
3 59 3.6%
 
Other values (29) 527 32.4%
 
ValueCountFrequency (%) 
0 15 0.9%
 
1 160 9.8%
 
2 61 3.7%
 
3 59 3.6%
 
4 79 4.9%
 
ValueCountFrequency (%) 
38 1 0.1%
 
37 4 0.2%
 
36 3 0.2%
 
35 2 0.1%
 
34 9 0.6%
 

CommunicationSkill
Real number (ℝ≥0)

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.135749385749386
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.408770357
Coefficient of variation (CV)0.4492611442
Kurtosis-1.291879611
Mean3.135749386
Median Absolute Deviation (MAD)1
Skewness-0.1069868546
Sum5105
Variance1.984633919
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5 375 23.0%
 
4 342 21.0%
 
2 325 20.0%
 
3 313 19.2%
 
1 273 16.8%
 
ValueCountFrequency (%) 
1 273 16.8%
 
2 325 20.0%
 
3 313 19.2%
 
4 342 21.0%
 
5 375 23.0%
 
ValueCountFrequency (%) 
5 375 23.0%
 
4 342 21.0%
 
3 313 19.2%
 
2 325 20.0%
 
1 273 16.8%
 

YearsToCompanies
Real number (ℝ≥0)

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.095823095823096
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.23984544
Coefficient of variation (CV)0.591579243
Kurtosis0.03728458802
Mean2.095823096
Median Absolute Deviation (MAD)1
Skewness0.9995105561
Sum3412
Variance1.537216716
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 701 43.1%
 
2 418 25.7%
 
3 292 17.9%
 
5 131 8.0%
 
4 86 5.3%
 
ValueCountFrequency (%) 
1 701 43.1%
 
2 418 25.7%
 
3 292 17.9%
 
4 86 5.3%
 
5 131 8.0%
 
ValueCountFrequency (%) 
5 131 8.0%
 
4 86 5.3%
 
3 292 17.9%
 
2 418 25.7%
 
1 701 43.1%
 

PropCurrMgrCompYears
Real number (ℝ≥0)

ZEROS
Distinct count104
Unique (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4231117544340891
Minimum0.0
Maximum0.8947368421052632
Zeros414
Zeros (%)25.4%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.5
Q30.6666666667
95-th percentile0.8181818182
Maximum0.8947368421
Range0.8947368421
Interquartile range (IQR)0.6666666667

Descriptive statistics

Standard deviation0.291229786
Coefficient of variation (CV)0.6883046452
Kurtosis-1.276010581
Mean0.4231117544
Median Absolute Deviation (MAD)0.1666666667
Skewness-0.3523783188
Sum688.8259362
Variance0.08481478826
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 414 25.4%
 
0.5 224 13.8%
 
0.6666666667 210 12.9%
 
0.7777777778 67 4.1%
 
0.6 62 3.8%
 
0.4 61 3.7%
 
0.3333333333 54 3.3%
 
0.875 53 3.3%
 
0.6363636364 46 2.8%
 
0.7272727273 38 2.3%
 
Other values (94) 399 24.5%
 
ValueCountFrequency (%) 
0 414 25.4%
 
0.05882352941 1 0.1%
 
0.06666666667 1 0.1%
 
0.08333333333 2 0.1%
 
0.09090909091 3 0.2%
 
ValueCountFrequency (%) 
0.8947368421 1 0.1%
 
0.875 53 3.3%
 
0.8571428571 3 0.2%
 
0.8461538462 1 0.1%
 
0.8333333333 6 0.4%
 

PropAgeCompYears
Real number (ℝ≥0)

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9926289926289926
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size12.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.417233371
Coefficient of variation (CV)0.473574698
Kurtosis-1.309303646
Mean2.992628993
Median Absolute Deviation (MAD)1
Skewness0.01302808333
Sum4872
Variance2.008550429
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 336 20.6%
 
5 326 20.0%
 
1 326 20.0%
 
4 324 19.9%
 
3 316 19.4%
 
ValueCountFrequency (%) 
1 326 20.0%
 
2 336 20.6%
 
3 316 19.4%
 
4 324 19.9%
 
5 326 20.0%
 
ValueCountFrequency (%) 
5 326 20.0%
 
4 324 19.9%
 
3 316 19.4%
 
2 336 20.6%
 
1 326 20.0%
 
Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.8 KiB
2
506
1
437
3
431
4
254
ValueCountFrequency (%) 
2 506 31.1%
 
1 437 26.8%
 
3 431 26.5%
 
4 254 15.6%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

PropCurrRoleCompYears
Real number (ℝ≥0)

ZEROS
Distinct count102
Unique (%)6.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4424320887163953
Minimum0.0
Maximum0.875
Zeros361
Zeros (%)22.2%
Memory size12.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.1818181818
median0.5
Q30.6666666667
95-th percentile0.8333333333
Maximum0.875
Range0.875
Interquartile range (IQR)0.4848484848

Descriptive statistics

Standard deviation0.288742572
Coefficient of variation (CV)0.6526257461
Kurtosis-1.193376378
Mean0.4424320887
Median Absolute Deviation (MAD)0.1923076923
Skewness-0.4056759141
Sum720.2794404
Variance0.08337227289
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 361 22.2%
 
0.6666666667 243 14.9%
 
0.5 215 13.2%
 
0.6 74 4.5%
 
0.3333333333 71 4.4%
 
0.875 62 3.8%
 
0.7777777778 58 3.6%
 
0.4 50 3.1%
 
0.7 41 2.5%
 
0.5714285714 33 2.0%
 
Other values (92) 420 25.8%
 
ValueCountFrequency (%) 
0 361 22.2%
 
0.0625 1 0.1%
 
0.06666666667 5 0.3%
 
0.08695652174 1 0.1%
 
0.1071428571 1 0.1%
 
ValueCountFrequency (%) 
0.875 62 3.8%
 
0.8666666667 1 0.1%
 
0.8571428571 7 0.4%
 
0.85 1 0.1%
 
0.8461538462 3 0.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

AgeAttritionBusinessTravelDistanceFromHomeEducationFieldEmployeeNumberEnvironmentSatisfactionJobInvolvementJobRoleJobSatisfactionNumCompaniesWorkedOverTimePercentSalaryHikeStockOptionLevelTotalWorkingYearsCommunicationSkillYearsToCompaniesPropCurrMgrCompYearsPropAgeCompYearsPropTrainCompYearsPropCurrRoleCompYears
0300Non-Travel2Medical57133Laboratory Technician40No14012450.583333510.583333
1360Travel_Rarely12Life Sciences161433Manufacturing Director39Yes1227210.250000220.500000
2551Travel_Rarely2Medical84233Sales Executive44No16012520.300000420.700000
3390Travel_Rarely24Life Sciences201413Research Scientist47No13018420.875000410.875000
4370Travel_Rarely3Other68933Manufacturing Director31No15110130.727273410.636364
5310Travel_Rarely7Life Sciences94122Sales Representative33No15013220.250000410.875000
6321Travel_Rarely1Life Sciences33142Laboratory Technician30Yes1404120.500000220.500000
7330Travel_Rarely4Medical150212Laboratory Technician28No1108510.333333330.666667
8350Travel_Frequently11Marketing113743Sales Executive41No1115420.333333320.333333
9211Travel_Rarely7Marketing178023Sales Representative21No1301510.000000240.000000

Last rows

AgeAttritionBusinessTravelDistanceFromHomeEducationFieldEmployeeNumberEnvironmentSatisfactionJobInvolvementJobRoleJobSatisfactionNumCompaniesWorkedOverTimePercentSalaryHikeStockOptionLevelTotalWorkingYearsCommunicationSkillYearsToCompaniesPropCurrMgrCompYearsPropAgeCompYearsPropTrainCompYearsPropCurrRoleCompYears
1618291Travel_Rarely9Marketing175221Sales Representative21No1302210.666667230.666667
1619261Travel_Rarely8Technical Degree79642Sales Executive16No1706510.400000320.600000
1620331Travel_Frequently3Life Sciences70213Research Scientist11Yes11010430.636364520.727273
1621201Travel_Rarely10Medical70143Research Scientist31Yes1101410.500000240.000000
1622491Travel_Rarely11Marketing84033Sales Executive41No1829230.700000420.800000
1623421Travel_Frequently19Medical75234Research Scientist36Yes1207310.666667230.666667
1624551Travel_Rarely2Medical84233Sales Executive44No16012520.300000420.700000
1625251Travel_Rarely9Life Sciences143912Sales Representative13No1206510.500000320.500000
1626291Travel_Rarely13Human Resources184412Human Resources14Yes1534510.000000230.666667
1627291Travel_Rarely18Medical31532Research Scientist41Yes1304210.200000330.600000

Duplicate rows

Most frequent

AgeAttritionBusinessTravelDistanceFromHomeEducationFieldEmployeeNumberEnvironmentSatisfactionJobInvolvementJobRoleJobSatisfactionNumCompaniesWorkedOverTimePercentSalaryHikeStockOptionLevelTotalWorkingYearsCommunicationSkillYearsToCompaniesPropCurrMgrCompYearsPropAgeCompYearsPropTrainCompYearsPropCurrRoleCompYearscount
0181Non-Travel8Medical115633Laboratory Technician31No1200410.000000110.0000005
1181Travel_Frequently5Marketing61423Sales Representative21Yes1400210.000000140.0000005
2191Non-Travel10Medical124812Research Scientist21Yes2501510.000000230.5000005
3191Travel_Frequently1Technical Degree23531Sales Representative10No2101310.000000140.0000005
4191Travel_Rarely2Technical Degree56612Human Resources41No1201410.000000240.0000005
5191Travel_Rarely21Other95942Sales Representative21Yes1301510.000000240.0000005
6201Travel_Frequently9Marketing107743Sales Representative41Yes1402110.666667330.6666675
7201Travel_Rarely2Medical92232Sales Representative31No1302410.666667330.6666675
8201Travel_Rarely4Technical Degree96013Laboratory Technician11No1901410.000000230.0000005
9201Travel_Rarely10Medical70143Research Scientist31Yes1101410.500000240.0000005